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ABSTRACT 

We present a new and highly efficient algorithm for computing a power spectrum made from evenly 
spaced data which combines the noise-reducing advantages of the weighted fit with the computa- 
tional advantages of the Fast Fourier Transform (FFT). We apply this method to a 10-year data set 
of the solar p-mode oscillations obtained by the Birmingham Solar Oscillations Network (BiSON) 
and thereby uncover three new low-frequency modes. These are the (=2, n=5 and «=7 modes and 
the f=3, n -1 mode. In the case of the ^=2, «=5 modes, this is believed to be the first such identifi- 
cation of this mode in the literature. The statistical weights needed for the method are derived from 
a combination of the real data and a sophisticated simulation of the instrument performance. Vari- 
ations in the weights are due mainly to the differences in the noise characteristics of the various 
BiSON instiTiments, the change in those characteristics over time and the changing line-of-sight 
velocity between the stations and the Sun. It should be noted that a weighted data set will have a 
more time-dependent signal than an unweighted set and that, consequently, its frequency spectrum 
will be more susceptible to aliasing. 
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1 INTRODUCTION 

Low-angular-degree (low-f) solar acoustic (p) modes travel deep into 
the solar interior and as such can be used as probes of the solar 
core. Results from various completed and ongoing observational pro- 
grammes have revealed a very rich spectrum of low-^ p-modes in a 
frequency range of around 1.0 to 5.5 mHz. An important and ongo- 
ing goal in helioseismology has been the detection of acoustic modes 
with ever lower frequencies. This is because low frequency acoustic 
modes are extremely long lived and so once they are detected their 
frequencies can be determined to very high precision. This enhances 
their diagnostic potential as probes of the solar interior. Unfortunately 
the effects of increasing solar granulation background and decreasing 
amplitude of the modes as one moves lower in frequency makes de- 
tection very difficult. 

The Birmingham Solar Oscillations Network (BiSON) has been 
collecting low-^ data for over 3 decades and as such provides an ex- 
cellent opportunity to detect very low frequency acoustic modes. In 
order to obtain as continuous observations as possible BiSON com- 
prises six stations, separated in longitude around the Earth, each of 
which houses a resonance scattering spectrometer (RSS). However, as 
the network was not built all at once, but rather expanded over time 
from just a single station to the current six, the RSS's were not pro- 
duced to the same specifications. As such, the noise level of the data 
is different at each station. This means that the combined time-series 
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generated from joining together data from all the stations has a noise 
function that varies with time. 



Matters are further complicated by the fact that any Doppler shift 
measurements made by an RSS will have uncertainties and noise char- 
acteristics that depend on the line-of-sight v elocity between the instru- 
ment and the Sun (see IChaplin et ai]|2005l) . The noise level changes 
through the day as the Earth spins and through the year as the Earth 
orbits the Sun. 



This time dependence of the noise suggests that, if we wish to 
make best use of the BiSON data and produce power spectra with the 
lowest possible noise, we should weight the data. If the noise level 
in the power spectra can be significantly reduced by such a method it 
may be possible to increase the signal-to-noise ratio (S/N) by a suf- 
ficient amount to detect more low frequency modes. The S/N needs 
to be assessed for each particular case because it also depends on the 
aliasing that results from multiplying the signal by the weight. Over 
the course of this paper we shall describe how a weighted power spec- 
trum for the BiSON data was produced using a new computationally 
efficient method. This is important when looking at very long time 
series as the traditional sine wave fitting approach would take a very 
long time to compute (of the order of a few weeks even when using a 
high performance computing cluster). We go on to present results on 
how much the S/N is improved and discuss the detection of new low 
frequency modes. 
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2 THE DATA 

In addition to using real BiSON data we have also used simulations 
to inform us about any improvements in S/N and the likelihood of 
detecting new modes. However, for the simulated data to be useful we 
need the noise characteristics to follow closely the real data. The most 
effective way of doing this is to simulate the solar absorption line and 
then run through the acquisition and analysis of the data in the same 
way as is done with the real solar observations. In order to explain 
how this is done we start by briefly explaining how the real data are 
collected. 

At all six statio ns a resonance scattering spectrometer (RSS) (see 
ISrookes et alJll97"^ is used to measure the Doppler velocity shift of 
the 770-nm Dl potassium absorption line. In each case incident solar 
light is passed through a cell containing a vapour of potassium atoms 
and photons of the appropriate energy are resonantly scattered from 
the incident beam. Ideally, detectors placed at right angles to the inci- 
dent beam record only scattered photons. The absorption cross section 
of atoms in the cell is much narrower than that of the solar lines be- 
cause the temperature in the cell is much lower than that of the Sun 
and there is no rotational broadening. This means only light from a 
relatively narrow band of the solar absorption line is detected. 

The vapour cell is placed in a magnetic field, which causes Zee- 
man splitting to occur. The single line is thus split into a multiplet 
with separations dependent on the field strength. The splitting alters 
the atomic states in such a way that atoms will interact with circularly 
polarized light. The blue-shifted transition is sensitive to one hand of 
circular polarization whilst the red-shifted transition is sensitive to the 
other. The splitting also improves the sensitivity of the RSS by mov- 
ing the passbands out onto the wings of the Fraunhofer line where the 
slope is greatest and hence any given line shift will result in a greater 
change in measured intensity. 

A linear polariser and an electro-optic quarter-wave plate are 
used to circularly polarise the incident light and, by switching quickly 
between one-handedness and the other, it is possible to measure the 
light intensity in the blue wing, h and red wing, 7^, almost simultane- 
ously. From these measurements a ratio, R, is formed which gives a 
near-linear proxy for the velocity shift of the solar line: 



To obtain velocity measurements of the solar oscillations, R is plotted 
against the line-of sight station velocity (with respect to the Sun) and a 
third-order polynomial function is fitted. The oscillations signal is then 
recovered by subtracting R from the polynomial and is calibrated using 
the fi tted gradient of R versus the station velocity (see lElsworth et al] 
[l995|). 

This is an idealised situation and in reality background offsets 
and instrumental noise will affect the intensity measure ments used to 
form R. A more realistic expression is thus given by dChaplin et al.l 
l2005h : 

jves ^ jnon ^ jelec ^ _ jres _ jnon _ jdec _ 
^ ~ II" + I'l"" + If' + h + I'," + + If' + 'r ^^"^ 

where is the desired contribution from resonantly scattered light 
and I'""', r'""^ and ; are background sources due to non-resonantly scat- 
tered light, electronic offsets and noise respectively. 

The measurements of the solar oscillations are collected at each 
station each day and combined to form a continuous time series with 
a duty cycle of around 86% (for 2007 and 2008). However, there are 
often occasions when two or even three stations are collecting data 
at the same time. In these cases the usual approach is to use the data 
with the lowest noise characteristics when producing the time series, 
although it is also possible to take an average of the data sets. 



In order to properly characterise how the various noise sources 
propagate through this complex system we use a realistic simulation 
of the BiSON data designed to include many of the instrumental noise 
effects in addition to the actual solar oscillations. The first step in the 
simulation process is to estimate Z"' by fitting a Gaussian profile to 
an observed line obtained using Doppler velocity observations of the 
centre of the solar disc. The observations were taken by the Themis 
solar telescope located at Izana, Tenerife [private communication with 
Rosaria Simoniello, 2008]. The effects of solar rotation, limb darken- 
ing, Doppler imaging and image rotation are all taken into account in 
order to generate a simulated line similar to that seen b y the RSS's 
at each of the BiSON stations (see lBroomhall et al.ll2009l) . The use of 
this more realisti c line shape is an enh ancement on earlier simulation 
work reported by 'Chapli n et aP (l2005l) . 

The 'operating point' on the simulated line is evaluated from the 
Doppler shift due to the changing line of sight velocity of the Sun 
to each station (approximate values for the gravitational red shift and 
convective blue shift are included). Artificial velocity oscillations can 
then be added. Intensity 'measurements' are made for F" and in 
the same way as with real data. At this point estimates of the various 
noise sources can easily be included. Finally, the ratio can be formed 
and analysed in the same way as with real data. 

In order to test the benefits of a weighted power spectrum we 
used a 10-year (3650-day) stretch of BiSON data covering the period 
January 1st 1997 to December 31st 2007. To correspond with this we 
also created a set of simulated data of the same length. In order to bet- 
ter observe any reductions in the background we chose not to add any 
additional noise due to a granulation-like background into the simu- 
lated data. However, the instrumental noise was based on that of real 
BiSON data, in order to make the weighting functions similar. The 
simulated d ata contained an oscillat i ons si gnal created in the manner 
described in' Jimenez-Rey es & et al.l ( l2008h which has been generated 
specifically to mimic the real solar signal. 



3 EFFICIENT WEIGHTED POWER SPECTRUM 

The traditional method of determining a power spectrum from a time 
series of data with known w eights is to perform a sine-wave fit- 
ting (SWF) ana lysis (see e.g., lKieldserj[T993 :l Frandsen et al.|[l995l : 
iNew et alJl2009l) . If all the measurements have equal weight, then the 
power spectrum computed for the evenly-spaced BiSON data takes 
the same form, whether one uses the computationally efficient Fast 
Fourier Transform (FFT), the SWF or t he definition adopted by Scar- 
gle for the periodogram (IScargIelll982l) . In previous publications we 
have always assumed equal weighting of BiSON data and always cal- 
culated the power spectrum using the FFT. 

The SWF is far more computationally expensive than simply tak- 
ing the FFT since the fits must be calculated for each separate fre- 
quency. In fact, to analyse a 10-year time series of BiSON data using 
an SWF method would take many weeks even when using a high per- 
formance computing cluster. As such, this makes the repeated calcula- 
tions needed for scientific analysis very difficult. Therefore, in study- 
ing the effect of applying weighting to BiSON data, we have first 
developed a new method whereby we substitute certain summations 
made during the SWF process with far more computationally efficient 
FFT's. 

Here we describe the main differences between our new method 
and the traditional SWF method. We leave a detailed derivation for 
Appendix 2. For a standard SWF approach, one fits: 

A, sm{l7Tf,h) + B, cosilnffy), (3) 

to the data, yi:, at each frequency, fi, where tk is the time of the kth ob- 
servation. A, and i?, can be evaluated by least squares fitting. The ap- 
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propr iate expressions are (Appendix l: lKieldsenll992l : [Frandsen et alj 



Al- 



and 



B, 



(4) 



(5) 



where sit = sin(2;r/|fA), c,<^ = cos(2nfjtk) and wj. is the statistical 
weight of the kth measurement. The power spectrum can then be ob- 
tained by evaluating + as a function of /. Computing the power 
spectrum in this manner takes considerably longer than computing it 
via an FFT because each unique summation given in the expression for 
A, and S, must be evaluated at each frequency. However, as we show in 
Appendix 2, by using certain properties of the Fourier Transform and 
by making certain (very good) approximations it is possible to rewrite 
the expressions for A, and B, in terms of an FFT and a summation over 
the weights. 



A, 



and 



B, 



-Mm[FFT(w,y,)] 

2(vn72) 



A'Re[FFT(w,>',)] 



(6) 



(7) 



2(w*/2) 

We have checked both the SWF and FFT methods using short time 
series and showed that both returned almost identical results (i.e., to 
within 10"*), for a variety of different realistic weighting functions. It 
should be noted that this method can only be applied to power spectra 
that have been made from evenly spaced data. 



4 DETEIfMINING THE WEIGHTING 

In iNew et alJ l l2009l) it was shown that by using the known noise 
characteristics of the Global Oscillations at Low Frequencies (GOLF) 
instruments on board the Solar and Heliospheric Observer (SOHO) 
spacecraft, it was possible to weight the time-series and hence pro- 
duce a weighted power spectrum. This resulted in an improved signal- 
to-noise ratio (defined as the ratio of the heights of the modes to the 
background and hereafter denoted S/N). The same principle can be ap- 
plied to BiSON data although determining the noise associated with 
the points in the time series is a little more difficult. This is because 
the noise characteristics change both on a daily and yearly basis due 
to a change in the line-of-sight velocity, and because the time series is 
made up from observations from six different stations each of which 
have different noise characteristics and which can also change over 
time due to mechanical wear, breakdown and parts being replaced. 

There are two possible methods of tracking the underlying noise 
characteristics of the various BiSON instruments. The first is to make 
some estimate based on measurements taken directly from the real 
data, and the second is to simulate the observation and analysis of the 
data as accurately as possible and predict the noise based on those 
simulations. In this work we combine both of these methods in order 
to derive as accurate a representation of the noise characteristics as 
possible. 

For the first approach we attempt to estimate the noise character- 
istics of the time series (and hence determine a weighting) by looking 
at the daily power spectrum for each station. The envelope of power 
associated with the strongest acoustic oscillations extends from about 
1.3 mHz to 5.5 mHz. Due to the exponential drop off of the solar 
granulation background the power at frequencies above the regime 
of acoustic modes will be almost entirely due to instrumental noise. 
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Figure 1. Variation in the amplitude of the noise for a BiSON time series for 
the year 2007. The fact that 6 independent traces can be made out highlights 
the fact that the noise level of the various stations remain relatively stable on a 
day-to-day basis, different stations tend to have veiy different noise levels (by 
up to about a factor of 8) and that data from all the stations are used in order to 
make up the full time series. 



While for frequencies below this regime the mean power will be a 
combination of the instrumental noise and the solar granulation back- 
ground. We expect only the instrumental noise to vary significantly 
over time but we still choose to determine the noise characteristics 
from the low frequency regimes. This is because it is known that the 
instrumental noise increases at lower frequencies and it is unknown 
whether this increase will always be proportional to the high frequency 
instrumental noise. Additionally the main aim of reducing the noise 
level and increasing the S/N is to detect weak low frequency acoustic 
modes, so weighting the time series according to the low frequency 
noise would seem the most logical choice. 

Once the daily mean noise values have been determined they can 
be matched to each point in the time series by tracking which data 
point comes from which station for each day. The main drawback of 
this approach is one of precision. Each day's data only lasts for around 
12-13 hours at best (significantly less during winter months and/or if 
there is poor weather). Since we average only over a limited range of 
the power spectrum the variation in estimated noise from one day to 
the next for any particular station can be quite large. This can be alle- 
viated somewhat by smoothing the daily values for each station over 
a number of days (say 60). However this approach runs the risk of 
giving disproportionably higher weightings to bad days of data that 
occur during a stretch of good days and vice versa. A second but re- 
lated problem is that this method cannot be used to track how the noise 
varies throughout the day. Fig. [T] shows the noise in the 0.8-1.3mHz 
range associated with each data point for a BiSON time series for the 
year 2007. 

The second approach to predicting the noise variations is to use 
simulated data created in the manner described in section|2] By adding 
in different noise sources into the simulator it is relatively easy to 
show how the change in the operating point aff ects the level of in- 
strumental noise on a daily and yearly basis (see IChaplin et ai]|2005l : 
iFIetcher et alll2009h . However it is much more difficult to predict the 
magnitude and type of noise for each RSS at the different stations. 
Therefore in creating the simulated data we scale the noise to approx- 
imately match that of the best performance seen in the real data for 
each of the stations. We also use the amount by which the instrumen- 
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tal noise varies throughout the years in order to predict the most likely 
type of noise. 

The big advantages of artificial data are that we can simulate the 
noise in isolation to the modes and also add in an oscillations signal 
with known characteristics. The simulator can also be used to show 
how the noise varies throughout the day. Therefore, we choose to com- 
bine the two methods described above by superimposing the predicted 
daily variations onto the day-by-day noise measurements from the real 
data and hence determine a noise value for each individual data point. 
In summary we can use the measurements of the station noise levels 
to set out long term weight variations and simulations of daily effects 
to set out diurnal weight variations. However, as we go on to explain 
in section (6] including the daily effect seem to have only a negligible 
impact on the resulting S/N of the weighted power spectrum. 



5 ANALYSIS 

In this section we discuss the various ways in which we have analysed 
the power spectrum. For our initial attempt at weighting the time series 
we have assumed the weights, wt, to be the inverse of the noise, where 
the noise is determined for each data point using the method outlined 
in the previous section. The weight is set to zero where there are gaps 
in the time series. 

The best way of determining how much of an improvement is 
seen in the S/N when weighting the data, is to actually compare the 
S/N of the individual modes in power spectra with and without weight- 
ing, and the results of doing this are given in the following section. 
However, it is also a useful exercise to try to make a prior prediction 
of the improvement in the S/N. 

The prediction is based on considerations of the signal and noise 
levels that are expected in the unweighted and weighted power spectra. 
In the unweighted case the time series is constructed from measure- 
ments from the various BiSON instruments. While all such instru- 
ments are expected to measure the same signal level the noise level 
for each instrument is different (strictly, each measurement is differ- 
ent, since the noise level depends also on the line of sight velocity). 
According to Parseval's Theorem the sum of power in a power spec- 
trum is proportional to the variance of the time series used to compute 
the power spectrum, and the constant of proportionality is unity for a 
common definition of the Discrete Fourier Transform' . 

Hence, using this definition, the expected total noise power, b" , 
in an unweighted spectrum obtained from a time series of A' points is: 

YM 

N 



b" 



(10) 



' Strictly, the sum of the power in the power spectrum is related to the sum of 
the power in the time series by: 



J]\X{if =Na^YjX{kf, (8) 
1=0 k=0 

where a is defined by the expression for the DFT. The general form of the DFT 



DFTa-(0 = X(i) = aYj 



-2n jnk/N 



(9) 



Common choices for a are unity (see, for example, pag e 97 of lBrighMiilll988l) 
or jj (see, for example, page 608 of IPress et al.ll2007ll . The latter is used by 
the IDL language, and we have adopted this definition throughout the paper. 
Making this choice for a means that the multiplier on the right hand side of 
Equation 8 is ^ , hence the statement in the main text relating the sum of the 
power in the power spectrum to the vaiiance of the time series. 



where, bj, is the variance attributable to a given point. 

When we weight the data we are essentially transforming a time 
series which is the original multiplied by the weights (see the argu- 
ment at the end of Appendix 2). In that case the standard deviation of 
each point in the weighted time series is Wk ^fbk, and the total noise 
power, b"' obeys: 

2 (hwj) 



b" 



N 



(11) 



In this work we define the weights as the inverse of the noise 
relative to the lowest noise level of any measurement, bi: 

Wk = —■ 

h 

Hence, we can write: 

2 wt 



(12) 



b" 



N 



and, using Bans. [Tol and II2I 

b" = b. 



N 



(13) 



(14) 



We now go on to consider the signal. For a continuous sinusoidal 
signal of amplitude a, the total signal strength in the power spectrum 
will be (as required by Parseval's theorem). Hence, using Ean.ll4l 
we see that the S/N for the unweighted case will be given by: 



S/N" 



2*1 2 li- ■ 



(15) 



In the weighted case, each point is multiplied by the appropriate 
weight and, assuming that variations in weight occur slowly compared 
to the periods of p modes of interest (so that the weight is constant over 
the period of the mode), the total signal power is reduced by a factor 
of 2 w^/N. Hence, the overall power, S , in the power spectrum would 
be: 



2 \ N 



(16) 



However, not all of that power will be contained within the main 
peaks of the spectrum as some will be removed into, for example, diur- 
nal sidebands. We can see the effect that weighting the time series has 
on the peaks in the power spectrum by making use of the convolution 
theorem. As the time-series we work with in the weighted case is the 
product of the weighting function w with the data y (see the argument 
at the end of Appendix 2), the transform will be the convolution of the 
transform of w with that of y. Firstly, we note that the transform of the 
purely sinusoidal y will just result in peaks of height a/2 at frequen- 
cies of ±l/P, where P is the period of the sinusoid. Secondly, while 
Wk (and hence its transform) is likely to be complicated, it takes the 
form of the sum of a complicated function, F, centred on zero with a 
d.c. level, D, which is given by: 



N ^ 



(17) 



Hence, making use of the linearity of Fourier Transforms, the trans- 
form of w will be the sum of transform of D, which is just a peak of 
height jj 2 with that of F. So the complete transform of wy will 
contain peaks of height ^ 2 frequencies of ± I/P and the total 
power in peaks at frequencies of ± I /P in the power spectrum will be 
H, given by: 



2 \ N 



(18) 



The S/N in the weighted case, 5 /N"', is determined by dividing 
Eqn.[Tlby Eqn.[T3l thus: 
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Figure 2. Top panel: Small slice of the power spectram of a 3650-day BiSON 
time series centered on the f=l, n=l 1 mode. The spectram has been divided by 
the underlying background. Overlaid on the spectram is the smoothing function 
designed to smooth the spectra over the two multiplet components. Bottom 
panel: The resulting smoothed spectram. 



S/N"-' 



2Nbi 



(19) 



In order to obtain separate values of the S/N in both the un- 
weighted and weighted cases we would need to know a'. However 
if we take the ratio of the two S/N's the a- terms cancel and we can 
obtain an estimate of the fractional improvement in the S/N for the 
weighted spectra compared with the unweighted one without the need 
to know a-, i.e. we can divide Eqn.|19|bv Egn.llSlto obtain: 



S/N" 



N^ 



(20) 



Firstly we note that this is always greater than 1 (see Appendix 3 
for a proof). I.e. with the assumptions made, weighting is always ad- 
vantageous. Secondly, for the real BiSON case we find that we should 
see a improvement in the S/N of around 50% if we weight the data. 
However, this is only a rough estimate since the amplitudes of the 
modes are not completely constant over time and the fact that the 
values of can only be estimated (see Section |4}- It turns out that 
the expected improvement based on these equations actually gives an 
overestimate of the true improvement we see (see section|6). Even so, 
Eas.ll0lthroughl20ldo help us better understand the effect of weighting 
the data. 

A more accurate method for estimating the improvement in the 
S/N is to look at the power spectrum directly. It is possible to fit the 
background at low and high frequencies in order to get an estimate 
of the power in the noise without the need to use EQS.Illlorll3land 
thus removing the need to estimate bk. This can be done for both the 
weighted and unweighted cases and can then be combined with Eg. 1181 
to obtain another estimate of how much the S/N increases when using 
weighted data. 



Although these predictions are useful in order to help discern the 
benefits from weighting the data, it is clear that we must look at the in- 
dividual S/N of the modes to get a full picture. One way of doing this is 
to fit each of the modes in the spectrum using a standard peak bagging 
approach. However, the modes at very low frequencies have extremely 
long lifetimes and consequently are very naiTow in the frequency do- 
main and are often difficult to fit. We therefore chose to employ a 
strategy whereby we smoothed the data (see below) and simply read 
off the maximum peak height. However, we also take into account 
that the modes are rotationally split. Hence, the smoothing function 
for the i = modes is a simple box car with a width equal to twice 
the expected linewidth of the mode. Whereas for the £ = 1 modes the 
smoothing function is a double box car with a spacing between them 
(measured from the centre of the boxes) equal to the rotational split- 
ting. The C = 2 and 3 modes have 3 and 4 box cars respectively to 
coincide with all the rotationally split multiplets. An example of this 
multiple box-car type smoothing, for an { = I mode, can be seen in 
Fig-El The big advantage of this approach is that, as well as smooth- 
ing the data, we also combine all the power from all the multiplets into 
a large peak at the central frequency of the mode. This has a distinct 
advantage when looking for modes with a wea k S/N. This method i s 
similar to the m-averaging method developed bv lSalabert et al. I (l2008h . 

It is also important to think about how to scale the data in or- 
der to compare more directly the power spectrum determined from the 
weighted and unweighted time series. Given that the aim of weight- 
ing the data is to reduce the background and hence improve the S/N, 
then the best way of scaling is to make the power of the signal (i.e 
the modes) the same in both cases. This can be done by fitting and 
removing the background in the two spectra and then summing the 
power over the region of the modes (say 1.3-5.5 mHz), and divid- 
ing the one total by the other in order to obtain a scaling factor. Or, 
alternatively, we can fulfil Parsevals's theorem by dividing both the 
unweighted and weighted spectra by 2 ^1 /N (where in the case of the 
unweighted data, wt would only take values of one or zero), although, 
as stated previously, this method will only be accurate if the amplitude 
of the modes do not vary much with time (i.e., if the signal is a pure 
sinusoid). 



6 RESULTS 

We now go on to present the effects of weighting the BiSON data on 
the power spectra and give the results on how much the S/N of the 
modes can be improved. In section l6n we look at the simulated data 
where we have a very clear understanding of how the noise function 
varies throughout the day and year. The simulated data also have the 
advantage that the input frequencies for the modes are known so we 
can calculate the S/N where the modes should be occurring even if 
those modes are too weak to be seen above the level of the noise. In 
section l6^ we give the results of weighting the real data. 

6.1 Simulated Data 

The top two panels in Fig. [3] show the power spectrum of the un- 
weighted and weighted time series of the simulated data. We have 
scaled the data by dividing by the mean of the squared weights in order 
to make the power in the modes the same in both plots (see Eq. I16t . 
We could also have chosen to scale the plots to give the same peak 
heights using Eq.[T8]or to give equal backgrounds using Eq.[T3] The 
plot shows that there is an overall reduction in the background level 
of the weighted spectrum. We remind the reader that for the simulated 
data we chose not to include a granulation like signal in order to more 
clearly observe any reduction in the background. (Hence the scales of 
Fig.[T]and Fig.[3]are not directly comparable.) The bottom two panels 
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give the corresponding spectral windows and show that the weighted 
spectrum has significantly more power removed from the main peak 
than the unweighted data. This means that the improvement due to the 
decrease in the background will be somewhat offset by the smaller 
height of the central peak. 

Using the mode-like smoothing function discussed in Section [5] 
we are able to compare the S/N of the individual modes. Note that 
since we know exactly where the modes should occur for the simulated 
spectrum, we take the S/N of the highest peak within a very restricted 
range of the known frequency (±0.05/iHz). Fig. |4] shows the S/N of 
the modes in the weighted spectrum divided by the S/N of the modes 
in the unweighted spectrum. The different symbols indicate different { 
and the solid line indicates the 'predicted' improvement in the S/N for 
the simulated data calculated by dividing the reduction in the power 
of the main mode peak by the reduction in the background. 

The plot shows that we can expect an improvement in the S/N 
of up to around 60% and the measurement of the S/N of the indi- 
vidual modes broadly agrees with this. However it is noticeable that 
at mid to high frequencies where the mode peaks are much stronger 
and broader the S/N improvement seems to be somewhat better than 
expected. This was found to be an effect of the mode width. Look- 
ing closely at the bottom panels of Fig. [3] it can be seen that there is 
power in the bins surrounding the central peak. The effect of convolv- 
ing a wide peak with such a spectral window is to give a peak in the 
power spectrum whose height is somewhat greater than if both func- 
tions had been 'narrower'. In the case of the weighted data there is 
more power surrounding the main central peak than in the unweighted 
data. Hence the effect on the peak's height is greater and we see an in- 
crease in the ratios of the S/Ns. The effect of this is further increased 
by the fact that we are averaging over the power in the modes when 
applying the mode-like smoothing function described in section |5] In 
contrast at low frequencies the improvement in the S/Ns is less than 
expected. This reflects the fact that the mode heights fall significantly 
at low frequencies and thus some modes may still not be seen above 
the background level (hence we will not be able to measure the true 
improvement in the S/N). Also, since the data is being weighted by 
the noise level at low frequencies, it is likely that as well as reducing 
the background level there will also be some reduction in the power of 
the low frequency modes. 

In Section|4]we explained how the noise function that the weight- 
ing is based on could be determined. This noise function varies both 
throughout the day and throughout the year due to the changing line- 
of-sight velocity between the stations and the Sun and also due to the 
different noise characteristics of the various instruments. Tracking the 
differences between instruments and other long term variations can be 
done by measuring the noise in the power spectra from daily time se- 
ries from each station. However daily variations can not be measured 
in this way. Instead we must use the simulator to predict the variations 
and superimpose these onto the mean values for each day. However, 
the difference between using a weighting function with daily varia- 
tions compared to one without turned out to be very small. Fig.|4]gives 
the S/N of individual modes when fixed values are used throughout 
the day. Using a varying value improved these results by less than 1%. 
This suggests that when weighting the real data it is not important to 
take into account the impact of the daily variations of the noise. This 
is because the absolute magnitude of the daily variation will be pro- 
portional to the mean noise over the day. Therefore, the daily variation 
will only be of a similar order to the difference in noise between the 
stations when the noise itself is large. However, when the noise is large 
the weighting is small and so the overall impact of the daily variation 
on the power spectrum ends up being small. 

In the preamble of this section we also discussed that we chose 
to weight the time series as the inverse of the noise. This seems to be 
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Figure 4. Ratio of the S/N at known mode frequencies for the simulated 
weighted and unweighted data. The symbols represent different t, with ^=0 
given by diamonds, 1=1 by triangles, t=2 by squares and /'=3 by circles. The 
solid line shows the 'predicted' improvement in S/N (see text) and a dotted line 
is given at unity to aid the eye. 

the best option. When taking the inverse of the square of the noise or 
the inverse of the square root of the noise the improvement in the S/N 
is not as good. 

In order to help confirm the detection of modes statistical tests 
are often employed. One such method is to calculate the probability 
of any particular pe ak in the spectrum being solely due to noise (see 
IChaplin et al1l2002h . In order to do this we need to know the under- 
lying noise distribution. For p-mode power spectra made from long 
uninterrupted time series this is known to to be negative exponential 
(i.e.,;^^^ with 2 degrees of freedom). Simulations were performed and 
showed that such a distribution remains valid even when applying the 
weighting function to the time series'. 

Therefore, the probability, p, of getting, by chance, a spike of 
power, (, in any particular bin is: 

P(v) = exp(-A), (21) 

where {(y) is an appropriate average background level. From this the 
probability, P, of there being at least one peak greater than or equal to 
^ over a range of N bins can be calculated; 

P = 1 - (1 - p{y)f (22) 

The value of A' must be carefully chosen by the user as its value will 
affect the calculation of P . Discussion of the best value to use has 
alread y been given in both lChaplin et al.] ( l2002h and lBroomhall et ^ 
Here we choose a value of A'^ equal to the number of bins in a 
100 uYiz slice of our sp ectra. This is the same frequency range used in 
iBroomhall et al.l ( |2007|) . 

Tests can be performed in order to check for the occurrence of 
power over a range of bins (since the power in the modes will gener- 
ally be spread over many bins given long time series) and for power 
arranged in the various mode like structures (i.e the rotational split- 
ting patterns for t > 0). As discussed earlier we haven taken a slightly 
different approach and smooth the peaks using a mode-like smoothing 
function. Therefore, we can simply apply Eq. |22]for the highest peak 
after we have taken the convolution. However, it should be noted that 
in calculating p, the fact that we are averaging over a number of bins 

' Strictly, the use of the with 2 degrees of freedom distribution is limited to 
situations in which the noise in the time series is stationary, which, in practice 
for BiSON data, it never is. In the unweighted case the noise changes because 
BiSON data are taken by different instruments. In the weighted case, both noise 
and signal change. Therefore, pragmatically, before carrying out peak detection 
methods that employ statistical tests, we always check the underlying distribu- 
tion. 
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Figure 3. Smoothed power spectra of the unweighted (top left) and weighted (top right) simulated time series. The data has been scaled so that the power in the 
modes is the same in both plots, however, more of that power will be aliased out into side bands in the weighted case. Dotted and dashed lines shows the underlying 
background in the unweighted and weighted cases respectively. Bottom panels show the spectral windows for the two power spectra. 



Table 1. Table giving the frequencies, v, for the simulated data, of the known 
modes for low n and the probabilities, P, of the power at those frequencies 
occurring, by chance, over a range of 100 ^Hz. Peaks which are strong enough 
to have only a low probability of occurring by chance (P < 0.1) are printed in 
bold. Modes which are identified by this means in the weighted spectrum only 
are denoted by an asterisk. 



i 


n 


Vu (pHz) 


Pu 







7 


1118.02 


1.00 


1.00 





8 


1263.49 


1.00 


0.00 * 





9 


1407.47 


0.42 


0.00 * 





10 


1548.42 


0.00 


0.00 





11 


1686.74 


0.00 


0.00 




7 


1185.60 


1.00 


1.00 




8 


1329.64 


0.00 


0.00 




9 


1472.92 


0.00 


0.00 




10 


1612.60 


0.00 


0.00 




11 


1749.33 


0.00 


0.00 


2 


6 


1105.25 


1.00 


1.00 


2 


7 


1250.81 


1.00 


0.77 


2 


8 


1394.61 


0.00 


0.00 


2 


9 


1535.93 


0.00 


0.00 


2 


10 


1674.58 


0.00 


0.00 


3 


6 


1161.74 


1.00 


1.00 


3 


7 


1306.85 


0.35 


0.09 * 


3 


8 


1451.01 


0.08 


0.00 


3 


9 


1591.42 


0.00 


0.00 


3 


10 


1729.14 


0.00 


0.00 



level of power in the tested bin is due solely to the noise distribu- 
tion. In the case of simulated data, where we know a mode should be 
present, this indicates the S/N of this mode is too low to be seen above 
the noise level. When P is close to zero, there is only a small chance 
that that level of power is due to noise. A value of P < 0.1 (given in 
a bold typeface in the table) means there is less than 10% chance of 
the peak being due solely to noise and as such usually warrants being 
investigated further. 

For regions of the spectrum where there is known to be a mode 
there are three occasions where P is below the 0.1 threshold in the 
weighted data but not in the unweighted data (these modes are denoted 
by an asterisk in Table[TJ. Conversely there are no instances where the 
opposite is true. This suggests that not only is there a clear increase 
in the S/N when using the weighted power spectrum, but also that this 
increase may allow us to uncover more yet unidentified modes in the 
real BiSON data. 

It is useful to estimate the value of the S/N at which we would 
expect the weighted power spectrum to give a detection when the un- 
weighted spectrum does not. We estimated this by gradually increas- 
ing the power of the f = mode at 1 1 18.02 yuHz in the simulated data 
and seeing whether or not we obtained a detection in both the weighted 
and unweighted cases. We made 20 realizations for each mode power 
in order to get an average value of the S/N. From this test we ascer- 
tained that an average S/N of a little above 2.0 was required in order 
for a single mode to be detected. Therefore, if a mode in the weighted 
spectrum has a S/N of about 2.0 we expect to get a positive detection 
whilst in the unweighted case the S/N is likely to be only about 1.7 
(see Fig.|7}, and a detection is not expected. 



must be accounted for. It can be shown that calculating the probability 
of having a power, s, averaged over Nav bins for a.x^ distribution with 
D degrees of freedom is equivalent to finding the probability of hav- 
ing power, s, in a single bin for dix^ distribution with DNav degrees of 
freedom (see Appourchaux 2004). 

Table[T]gives P at the stated frequencies (which are the known in- 
put values for the modes) for both the unweighted and weighted cases. 
A value of P close to unity indicates there is high probability that the 



6.2 Real Data 

Fig.[5]shows the power spectrum and spectral windows of a real 10- 
year BiSON time series for both the weighted and unweighted data. 
Again we have scaled the plots so that the power in the modes are the 
same. The background signal, n, as a function of frequency, v, was 
parameterised and fitted using a power-law model (lHarvevlll985l) . A 
two-component power law was used, i.e.. 
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n(v) 



l+ivTiY 1+(VT2)* 



(23) 



where T[ and cti represent the time constant and standard deviation 
respectively of the granulation signal, T2 and 1X2 give the same param- 
eters for the meso-and super-granulation component and a and h are 
the respective power law exponents, although we chose to fix b with a 
value of 2. While we cannot be sure that a Harvey model will still be 
the best model for the background in the weighted case, it can still be 
used as a guide to the eye to show the improvement. However, we note 
that due to the weighting process the fit to the background is unlikely 
to give reliable estimates for T2 and 1T2. 

The fits to the background clearly show that there is a signifi- 
cant reduction in the background level in the weighted case (but of 
course, even if our process were perfect, could only reduce the noise 
level to that of the lowest obtained in the full data set). However, even 
though the real data were weighted according to the noise in the low 
frequency region of the spectrum, it is the highest frequencies where 
the greatest reduction in the noise level is seen. This is because much 
of the noise component at low frequencies is made up of the granu- 
lation background signal which does not vary significantly with time. 
In contrast, at high frequencies the granulation background will have 
dropped almost to zero meaning there will be a greater impact on the 
noise due to the weighting. Exactly how much of the low frequency 
background is instrumental and how much is solar is difficult to ascer- 
tain. The instrumental noise is clearly not independent of frequency. If 
one removes the high frequency background from the low frequency 
background there are still significant variations in the noise amplitude 
over time. This suggests a higher level of instrumental noise at low 
frequencies than at high frequencies. The spectral windows, shown in 
the bottom panels of Fig. |5] look similar to those for the simulated 
data, which is to be expected given the artificial noise was designed 
to mimic the real noise. While Fig. |5] shows a reduction in the back- 
ground for the weighted case, it is not clear how the S/N of the low 
frequency modes are affected. This is shown more clearly in Fig. [6] 
where close-up views of the spectra over the region 800 - 1800 yuHz 
are given. It can be seen from this plot that the modes in the weighted 
spectrum have a higher S/N and that there are more visible modes at 
low frequencies. 

Fig.Qshows the ratio of the S/N's for the real BiSON data. They 
have been calculated using the same smoothing process as described 
in section [5] However, since we do not know the true frequencies of 
the modes in the real data, we compare the S/N's of the maximum 
peak height in the vicinity of where the modes are predicted to lie. 
These predictions are based on the estimates returned from model 'S' 
jChristensen-Dalsgaard et al.lll996l) . A gain we see an increase in the 
S/N when using the weighted data, however this improvement is less 
than was seen in the simulated case. We also note that the prediction is 
less than for the simulated case because although the noise character- 
istics are similar in the two cases they are not identical. With the real 
data we have plotted down to lower frequencies than for the simulated 
data (for which we knew there were no modes below 1 100 yuHz). And 
it seems that even below 1000 ^Hz there is evidence for improvements 
in S/N for the weighted spectrum. Of course this may just be due to 
randomly higher noise spikes and so we again need to perform statis- 
tical tests to see how likely the peaks are to be due solely to noise. 

Table[2]gives the frequencies of the highest peaks, and as can be 
seen these values are not necessarily the same in the unweighted as 
they are in the weighted cases, although where P < 0. 1 for both, the 
values are generally very close. There are three distinct cases where 
there is a peak within the vicinity of a predicted mode that has a P 
below the chosen threshold in the weighted spectrum but not in the 
unweighted one (denoted by an asterisk in Table|2j. These are the £=2, 
n=5 and n=7 modes and the {=3, n =7 mode. In the case of the i=2. 
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Figure 7. Ratio of the S/N for the highest peak in the vicinity of predicted 
modes frequencies for the real weighted and unweighted data. The symbols 
represent different t, with t=0 given by diamonds, t=\ by triangles, t=2 by 
squares and f=3 by circles. The solid line shows the 'predicted' improvement 
in S/N (see text) and a dotted line is given at unity to aid the eye. 



Table 2. Table giving the frequencies, v, for the real BiSON data, of the highest 
peaks after convolution (see section |5) in regions surrounding the areas where 
modes are expected to occur for both the unweighted, u, and weighted, »', data. 
Also given are the probabilities, P, of such peaks occurring, by chance, over 
a number of bins covering a range of 100 /jHz. Peaks which ai'e high enough 
to have only a low probability of occurring by chance (P < 0. 1) are printed in 
bold. Modes which are identified by this means in the weighted spectrum only 
are denoted by an asterisk. The frequency given for the i=2, n=5 mode denoted 
by the t symbol is likely to be that of the m=-2 component only 



c 


n 


Vi, (uHz) 


P. 


v„ (^Hz) 







6 


972.61 


0.69 


972.61 


0.46 





7 


1119.09 


1.00 


1118.20 


1.00 





8 


1263.22 


0.00 


1263.19 


0.01 





9 


1407.49 


0.00 


1407.49 


0.00 





10 


1548.34 


0.00 


1548.34 


0.00 




6 


1040.93 


1.00 


1038.77 


0.52 




7 


1185.60 


0.06 


1185.60 


0.00 




8 


1329.63 


0.00 


1329.62 


0.00 




9 


1472.83 


0.00 


1472.82 


0.00 




10 


1612.70 


0.00 


1612.70 


0.00 


2 


5 


961.25 


0.98 


958.77 


0.03 *t 


2 


6 


1106.21 


1.00 


1104.54 


0.86 


2 


7 


1249.50 


0.98 


1251.39 


0.02 * 


2 


8 


1394.69 


0.00 


1394.70 


0.00 


2 


9 


1535.86 


0.00 


1535.85 


0.00 


3 


5 


1016.16 


0.95 


1013.96 


0.79 


3 


6 


1161.26 


1.00 


1162.71 


0.81 


3 


7 


1308.16 


0.79 


1306.66 


0.08 * 


3 


8 


1450.96 


0.00 


1450.99 


0.00 


3 


9 


1591.48 


0.00 


1591.60 


0.00 



11=5 mode this has never before been identified in the power spectra of 
any sun-as-a-star data, although given its location, it does seem likely 
that we have only detected the m = -2 component. 



7 SUMMARY 

We have taken a long 3650-day time series of BiSON data and ap- 
plied a weighting function in order to improve the signal-to-noise ratio 
(S/N) in p-mode power spectrum. The weighting function was based 
on the noise levels at low frequencies seen in the power spectra of 
daily time series from each of the six BiSON stations. A sophisticated 
simulation was used to track how the noise varies throughout the day. 
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Figure 5. Smoothed power spectra of the unweighted (top left) and weighted (top right) real BiSON time series. The data has been scaled so that the power in the 
modes is the same in both plots, however, more of that power will be aliased out into side bands in the weighted case. Dotted and dashed lines shows fits to the 
background in the unweighted and weighted cases respectively. Bottom panels show the spectral windows for the two power spectra. 




Figure 6. A close up view of the power spectra of the unweighted (top) and weighted (bottom) real BiSON time series over the region 800 - 1800 /iRz . The data has 
been scaled in the same way as in Fig 5. but has not been smoothed. There are clearly more modes visible in the weighted spectrum compared with the unweighted. 



but it was found that including this daily effect in the weighting ac- 
tually gave little further improvement in the S/N. The sophisticated 
simulator was also used to create a matching set of artificial data in 
order to help better understand the improvements that the weighting 
can bring. 

In order to analyse these very long data sets it was necessary 
to develop a new method for generating the power spectrum of the 
weighted time series. A traditional sine- wave-fitting method would 
have taken many weeks to complete for a 3650-day time series, even 
when using a high performance computing cluster. Instead we devel- 



oped a method where certain summations used in the sine-wave fitting 
process are replaced with far more computationally efficient FFT's. 
This reduced the time to compute the weighted FFT of the 3650-day 
time series to less than an hour on a single desktop machine. 

It was shown that the weighted power spectrum had a signifi- 
cantly lower background level than the unweighted case. However, we 
also showed that the spectral window of the weighted data was signif- 
icantly poorer. This means that more power is aliased out of the main 
central peak into a pseudo-random background structure and promi- 
nent sidebands. This is because the application of the weighting func- 
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tion essentially acts as a 'dirty' window function. The combined result 
of these two effects was that we expected an improvement in the S/N 
of the modes of around 30% for the real data. By measuring the the 
actual S/N of the peaks in both the unweighted and weighted spectrum 
this was shown to be an underestimate for high frequency modes but 
an overestimate at low frequencies. The actual improvements in the 
S/N for the low frequency modes seems to be about 15%-20%. 

Despite this fairly modest improvement there were three more 
additional peaks that passed the chosen threshold for a statistical test 
in the weighted data compared with the unweighted data. These were 
the {=2, n=5 and n=l modes and the {=3, n =7 mode. If confirmed 
then in the case of the {=2, n=5 and f=3, n=l modes these would both 
be new identifications for BiSON data. While the detection of the l=2i, 
11=7 mode was perhaps not unexpected given its expected amplitude, 
a possible detection of the £=2, n=5 mode is more surprising. The 
frequency of the peak is about IpHz below what is predicted from the 
solar models. However this does not rule it out as a potential mode as 
we could be seeing only the m = —2 component. It should be noted 
that the C=Q, n=6 mode which is not seen in either the unweighted or 
weighted spectra has actually been detected in other earlier BiSON 
data sets. It seems that this particular mode was excited to greater 
amplitudes in the past than it is currently and hence tends to appear 
only in earlier stretches of data. 

It is clear that by weighting the BiSON time series we gain a sig- 
nificant improvement in the S/N of the modes and it seems likely that it 
will be possible to identify more low frequency modes using this tech- 
nique. We hope to do further studies using this weighting method with 
different time spans and longer BiSON time series in future. There is 
also scope for performing a search of contemporane ous BiSON and 
GOLF data in a similar fashion to that explained in Broomhall et alj 
but using weighted data instead of unweighted. 



APPENDICES 

7.1 Appendix 1 The SWF formulae 

In this Appendix we present the derivation of Equations 4 and 5 in 
Section 3. If we fit: 



Ai sm{2nfjh) + Bi cos{2nfitt). 



(A-1) 



to the data, yj., at each frequency, where tj^ is the time of the A;th 
observ ation, then the method of leas t squares (see, for example, page 
144 of lBevington & Robinsorjll992h allows us to find the values of A, 
and Bi by minimising: 

Yj = Y}^''^'' ~ - B,cos{2T:f,h)f) (A-2) 

with respect to A, and S,. Here w^. represents the statistical weight of 
the feth observation. 

It is useful to simplify the notation by setting Sjk = sin(2;r/;/j.) 
and Ctt = co^{2nfitk). Taking the partial derivatives of the right hand 
side of Equation A-2 with respect to A; and S, and setting the results 
equal to zero to identify the minima, we obtain: 



and 
35, 



^(H'A-[-VtCrt -I- B,cfj. + A,.S,tC,t]) = 0. 



(A-3) 



(A-4) 



These two equations may be solved simultaneously to obtain Equa- 
tions 4 and 5 in Section 3. 



7.2 Appendix 2 Derivation of Equations 6 and 7 

We aim to evaluate the power spectrum of the time series using a sine- 
wave fit, but with any efficiencies we can use in order to make the 
calculation less computationally intensive. All the way through this 
analysis we assume that the time-series has been sampled at a regular 
cadence (so that its Fast Fourier Transform (EFT) can be calculated) 
and that the fitted frequencies are those that would have been returned 
by a EFT of the time-series. So for each frequency, one fits: 



A, sin(2;ry;4) + B, cos(2n fit^). 



(A-5) 



to the data, yt, where is the time of the ^th observation. 

A, and fi, can be evaluated by l east squares fitt i ng. The appro- 
priate expressions are (Appendix 1 ; iKieldserj 1 19921 : iFrandsen et alj 
1995): 



Al- 



and 



B, 



(2 WkSikCikf - 2 WkCl 2 WkSl 



(A-6) 



(A-7) 



where = sin(2;r/ifi), c,x. = cos(2nfiti,) and wi, is the statistical 
weight of the kth measurement. 

The power spectrum can then be obtained by evaluating Aj + Bj 
as a function of This takes significantly longer to compute than, say, 
a power spectrum that uses an EFT approach, because each unique 
summation in the expressions for A, and S, must be evaluated for each 
frequency, i.e, for each value of /. 

It is, however, possible to make use of Fourier Transform prop- 
erties so that the summations required can be evaluated using EFT 
methods, thus speeding up the process of sine-wave fitting. To see 
how this works, we start from the definition of the Fourier Transform 
and then identify its real and imaginary parts with integrals over sines 
and cosines. 

The Fourier transform, G(/), of a function of time, ^(0, is defined 

by: 

G(f) = j g{t)e-'-'"fdt (A-8) 

= £,(0cos(2../0..-;£g(0sm(2.,/0... 

If g(t) represents a set of measurements (i.e., git) is always real), then 
the two integrals above are always real and we can write: 



G(/) = Re[G(/)] + Im[G(/)], 
where 

Re[G(/)] = J g{t)w^{2mft)dt, 
and 

Im[G(/)] = - J git) im{2nift)dt. 



(A-9) 



(A-10) 



(A-11) 



G(/) can be evaluated using EFT methods and the integrals above can 
be replaced by summations if g{t) is a discrete function of time. To 
link this explicitly to our situation, we can define the k\h element of g 
by: 



g(tk) = Wkyk, 



(A-12) 



and therefore two of the summations we need to evaluate (see Equa- 
tion A-6 and A-7) are given by: 



Y ^tytc^ik = NRe:[FFT(wkyk)], 



(A- 13) 
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and 

Yj ^kykS.k = -Mm[FFT(wt.Vt)], (A-14) 

The numerical factor of N arises in Equations A- 13 and A-14 
if we adopt the definition of the FFT used, for example, by the IDL 
language, i.e. if the FFT of x{t) is defined by: 



VVT{x(t)) = i Y}x(t). 



(A- 15) 



= Re[FFT(x(f))] + ;im[FFT(x(0)] 



The remaining three summations in Equations A-6 and A-7 can easily 
be approximated. Firstly, the summations over sc, s^ and c" can be 
expressed in terms of sinusoidal functions of the double angle, i.e.. 



^ WkSikCik = 2 Y sin(2 X In fit,,). 



(A- 16) 



(A-17) 



(A- 18) 



Secondly, we note that the the summation in Equation A-16 and the 
second terms in the summations of Equations A-17 and A- 18 essen- 
tially "pick out" the harmonic content of Wk- They will be very nearly 
zero when the frequency under consideration is well away from any 
frequency content of wt. This is true in the BiSON case, where the 
changes in Wj^ are very much slower than changes in due to solar p 
modes. This means that for all frequencies of interest the first terms in 
each of Equations A-17 and A- 18 will be dominant and constant. So to 
a very good approximation. Equations A-6 and A-7 can be re-written:- 

Mm[FFT(wo'*)] 



Al- 



and 



2(^^/2) 



ATle[FFT(w*y*)] 



(A-19) 



(A-20) 



2(^*72) 

These are Equations 6 and 7 in the main text. In this approxima- 
tion we see that the usual method of calculating the power spectrum 
after performing a SWF, i.e. by evaluating + B], yields the same 
result as taking the modulus-squared of the FFT of wj.yj. (to within a 
numerical multiplier of AN' l(Lwkf'). The multiplier equals 4 in the 
unweighted case (i.e. where all the wt equal unity). The discussions of 
Section 5 rely on the fact that in the weighted case we are essentially 
dealing with a time series given by wtvt- 



7.3 Appendix 3 Limiting value of Equation 20 

We begin by noting that the product of the summations has terms, 
A' of which are just given by w^^/w^^ (each of which is equal to 1 of 
course) and N^-N of which are wj/wk - The latter terms can be "paired 
up", so that there are (A'^ - N)/2 terms of the kind Wjiwk + w^/wj. If 
each of these terms is greater than 2, then the product of 'Z.'^'E'^j 
will be greater than A''. To show that this is indeed the case, we begin 
by defining; 



Xjk = WjIWk, 

so that each paired term, tj^, is given by: 
1 



Xjk 



(A-21) 



(A-22) 



By differentiating t with respect to x it can easily be shown that 
tjt has a minimum value of 2 when xj^ = 1. Hence: 



if all the Xj^ are equal to I, i.e. if all the data points have equal weight, 
and is otherwise greater than N^, in which case the LHS of Ean.[20lis 
greater than I . 
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